Operant Conditioning in Skinnerbots

نویسندگان

  • David S. Touretzky
  • Lisa M. Saksida
چکیده

Instrumental (or operant) conditioning, a form of animal learning, is similar to reinforcement learning (Watkins, 1989) in that it allows an agent to adapt its actions to gain maximally from the environment while only being rewarded for correct performance. But animals learn much more complicated behaviors through instrumental conditioning than robots presently acquire through reinforcement learning. We describe a new computational model of the conditioning process that attempts to capture some of the aspects that are missing from simple reinforcement learning: conditioned reinforcers, shifting reinforcement contingencies, explicit action sequencing, and state space re nement. We apply our model to a task commonly used to study working memory in rats and monkeys: the DMTS (Delayed Match to Sample) task. Animals learn this task in stages. In simulation, our model also acquires the task in stages, in a similar manner. We have used the model to train an RWI B21 robot.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The operant and the classical in conditioned orientation of Drosophila melanogaster at the flight simulator.

Ever since learning and memory have been studied experimentally, the relationship between operant and classical conditioning has been controversial. Operant conditioning is any form of conditioning that essentially depends on the animal's behavior. It relies on operant behavior. A motor output is called operant if it controls a sensory variable. The Drosophila flight simulator, in which the rel...

متن کامل

Treatment of Children’s Aggression by Behavioral Therapy Techniques

Objective: The present study aims at investigating the effects of behavioral therapy techniques through operant conditioning and observational learning on children’s aggression aged 4-6 years.  Methods: To this end, a pretest-posttest quasi-experimental study with two experimental groups and a control group was designed. We used non-probability purposive sampling method to select 45...

متن کامل

Molecular Mechanisms Underlying a Cellular Analog of Operant Reward Learning

Operant conditioning is a ubiquitous but mechanistically poorly understood form of associative learning in which an animal learns the consequences of its behavior. Using a single-cell analog of operant conditioning in neuron B51 of Aplysia, we examined second-messenger pathways engaged by activity and reward and how they may provide a biochemical association underlying operant learning. Conditi...

متن کامل

Neural Operant Conditioning as a Core Mechanism of Brain-Machine Interface Control

The process of changing the neuronal activity of the brain to acquire rewards in a broad sense is essential for utilizing brain-machine interfaces (BMIs), which is essentially operant conditioning of neuronal activity. Currently, this is also known as neural biofeedback, and it is often referred to as neurofeedback when human brain activity is targeted. In this review, we first illustrate biofe...

متن کامل

Operant reward learning in Aplysia: neuronal correlates and mechanisms.

Operant conditioning is a form of associative learning through which an animal learns about the consequences of its behavior. Here, we report an appetitive operant conditioning procedure in Aplysia that induces long-term memory. Biophysical changes that accompanied the memory were found in an identified neuron (cell B51) that is considered critical for the expression of behavior that was reward...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Adaptive Behaviour

دوره 5  شماره 

صفحات  -

تاریخ انتشار 1997